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[57] ABSTRACT 

A method, apparatus, and article of manufacture for locating 
web pages from a network server. A count of retrievals of a 
web page is accumulated and the accumulated count and an 
address for the web page are stored in a record of a history 
log database at the network server. A multiple reference 
hotlisl is formatted from the records in the history log 
database, wherein the multiple reference hotlist comprises a 
list of addresses for web pages retrieved from the records 
and the list is sorted by the accumulated counts retrieved 
from the records. The multiple reference hotlist is then 
displayed for a user. 

19 Claims, 3 Drawing Sheets 
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MULTIPLE REFERENCE HOTLIST FOR 
IDENTIFYING FREQUENTLY RETRIEVED 
WEB PAGES 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

This invention relates in general to web servers, and more 
particularly, to a multiple reference hotlist generated from a 
history log. 

2. Description of Related Art 

As the popularity and usefulness of the Internet grows, 
and more of the general public are able to access the Internet 
both at home and at work, the usefulness of the vast number 
of web pages and Uniform Resource Locators (URLs) 
becomes difficult to manage. 

The typical way for a user to access a web page or web 
site is to use a search engine, or a list of web pages compiled 
by another user as "popular," to find a web page or site of 
interest, and then review the web page or use links in the 
web page to find other web pages of interest. Once a web 
page is found, the user can create a bookmark for the web 
page in their browser to recall the location of the page at 
some later date. 

However, there are hundreds of thousands of websites on 
the Internet, with millions of web pages located at those web 
sites, and finding the web pages that are popular is difficult. 

Once a user arrives at a particular web site, the user must 
traverse a typically static navigation structure, set up by the 
website designer, to get to a web page of interest. This 
process is slow and sometimes confusing, and requires 
additional time for each user to traverse from the web site to 
the web page of interest. 

It can be seen, then, that there is a need for a better way 
to find popular web pages and web sites on the Internet. It 
can also be seen that certain pages should be marked as "hot 
spots" for all users. It can also be seen, then, that there is a 
need for finding popular web pages at a particular web site. 
It can also be seen, then, that there is a need to expedite and 
clarify the path from the web site to a given web page. 

SUMMARY OF THE INVENTION 

To minimize the limitations in the prior art described 
above, and to minimize other limitations that will become 
apparent upon reading and understanding the present 
specification, the present invention discloses a method, 
apparatus, and article of manufacture for locating web pages 
from a network server. A count of retrievals of a web page 
is accumulated and the accumulated count and an address 
for the web page are stored in a record of a history log 
database at the network server. A multiple reference hotlist 
is formatted from the records in the history log database, 
wherein the multiple reference hotlist comprises a list of 
addresses for web pages retrieved from the records and the 
list is sorted by the accumulated counts retrieved from the 
records. The multiple reference hotlist is then displayed for 
a user. 

BRIEF DESCR1F110N OF THE DRAWINGS 

Referring now to the drawings in which like reference 
numbers represent corresponding parts throughout: 

FIG. 1 is a block diagram of an exemplary hardware 
environment of the preferred embodiment of the present 
invention, and more particularly, illustrates a typical distrib- 
uted computer system; 
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FIG. 2 shows one possible structure of the history log 
database according to the present invention; and 

FIG. 3 is a flowchart illustrating the logic performed in 
creating the history log database and the multiple reference 
5 hotlist according to the present invention. 

DETAILED DESCRIPTION Op THE 
PREFERRED EMBODIMENT 

-jQ In the description of the preferred embodiment, reference 
is made to the accompanying drawings which form a part 
hereof, and in which is shown by way of illustration the 
specific embodiment in which the invention may be prac- 
ticed. It is to be understood that other embodiments may be 

j5 utilized as structural changes may be made without depart- 
ing from the scope of the present invention. 

Overview 

When an Internet user retrieves web pages, they use a 
20 browser to transmit HyperText Transfer Prt)tocol (HTTP) 
commands from their computer to a web server executed by 
a connected computer. In turn, the web server responds with 
a HyperText Mark-up Language (HTML) (or other 
formatted) page that is transmitted to the browser for display 
25 to the user. 

Typically, users access web pages by using a search 
engine (Yahoo!™ , AltaVista™, etc.) to find pages regarding 
a topic of interest. If the web page is of some interest to the 
user, they "bookmark" the HTTP Uniform Resource Locator 
(URL) for that page in their browser in order to easily find 
the web page in the future. 

However, individual users have few resources to deter- 
mine what web pages are the most popular. Generally, 
popular web page must be found through trial and error by 
the individual user, or through lists manually created by 
other individual users or service providers (e.g., Yahoo!™ 
weekly picks). Web page popularity is not available to 
individual users because there is no centralized history 
logging of the number of retrievals ("hits") of a given web 
page, nor is there any way to create a "hotlist" from such 
centralized logging. 

For example, one web page of recent popularity is iden- 
tified by the URL "http://www.mars.jpl.nasa.gov" and con- 

45 tains pictures obtained from the Mars Pathfinder spacecraft. 
It was reported that this web page (and others associated 
with it) received over 1 million "hits" in one week (i.e., was 
retrieved 1 million times by users in one week). The web 
page contains hyperlinks to other web pages that display the 

5Q number of hits allowed per site, e.g., the web page identified 
by the URL "http://mars,sgi,com" can accept 20,000,000 
hits per day without overloading the web server. 

Each user that visited the Mars Pathfinder web page either 
obtained the information by using a search engine or from 

55 another user or from an external source. However, in the 
present invention, by generating a "multiple reference hot- 
list" from a history log that accumulates the total number of 
retrievals made of any number of different web pages, 
popular web sites can be accessed directly without the need 

60 of a search engine. 

Such a multiple reference hotlist can be useful both to 
users of the Internet in a local and a global sense. For 
example, a hotlist of popular web pages can serve as a guide 
to web pages of local, regional, national, or global interest 

65 in a given subject. In another example, the multiple refer- 
ence hotlist can be used by both novice and experienced 
users for traversing the Internet. Further, some web pages 
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are "hot spots" that any user would want to access quickly, 
even though the user would normally have to follow a long 
chain of links or accesses to find them. The present invention 
will also provide particular usefulness to websites that offer 
descriptions of items for sale from a constantly changing 
inventory. These "hot spots" will change over time and are 
dependent on the set of users and the application. 

There are other considerations that also relate to the 
popularity of a given web page. Some web pages are visited 
by mistake, or for a very short time, which would give a false 
popularity "score" to those web pages. The multiple refer- 
ence hotlist can take into account these and other 
parameters, e.g., the length of time that the web page was 
visited, the length of time between retrievals of a web page 
and another web page, etc., so that a true popularity "score" 
can be used to sort the multiple reference hotlist. 

Hardware Environment 

FIG. 1 is a block diagram of an exemplary hardware 
environment of the preferred embodiment of the present 
invention, and more particularly, illustrates a typical distrib- 
uted computer system 10, wherein client computers 12 are 
connected via a network 14 to server computers 16. A typical 
combination of resources may include clients 12 that are 
personal computers or workstations, and servers 16 that are 
personal computers, workstations, minicomputers, and/or 
mainframes. These network 14 preferably comprises the 
Internet, although it could also comprise intra-nets, LANs, 
WANs, SNA networks, etc. 

Each of the computers, be they client 12 or server 16, 
generally include, inter alia, a processor, random access 
memory (RAM), data storage devices , data communications 
devices, monitor, user input devices, etc. Those skilled in the 
art will recognize that any combination of the above 
components, or any number of different components, 
peripherals, and other devices, may be used with the client 
12 and server 16. 

Each of the computers, be they client 12 or server 16, 
operate under the control of an operating system (OS), such 
as OS/390, AIX, UNIX, OS/2, Windows, etc. The operating 
system is booted into the memory of the computer for 
execution when the computer is powered -on or reset. In tura, 
the operating system then controls the execution of one or 
more computer programs by the computer. 

In the present invention, the operating system (not shown) 
of the client 12 controls the execution of a web browser and 
the operating system 18 of the server 16 controls the 
execution of a web server 20. The web browser is typically 
a computer program such as IBM*s Web Explorer, 
NetScape, Internet Explorer, Mosaic, etc. The web server 20 
is typically a computer program such as IBM's HTTP 
Daemon'^" or other World Wide Web (WWW) daemon. 

The present invention is usually (although not 
necessarily) implemented by a computer program 22 and its 
associated history log database 24 that are executed, 
interpreted, and/or stored in at least one of the servers 16 
under the control of that server's operating system 18. This 
computer. program 22, which may be a separate computer 
program 22 or may be implemented within the operating 
system 18 or the web server 20, and its associated database 
24 causes the server 16 to perform the desired functions as 
described herein. 

The operating system 18, web server 20, and computer 
program 22 are comprised of instructions which, when read 
and executed by the server 16, causes the server 16 to 
perform the steps necessary to implement and/or use the 
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present invention. Generally, the operating system 18, web 
server 20, computer program 22, and/or database 24 are 
tangibly embodied in and/or readable from a device, carrier, 
or media, such as a memory, data storage device, and/or data 

5 communications device connected to the server 16. Under 
control of the operating system 18, the web server 18, 
computer program 22, and/or database 24 may be loaded 
&om the memory, data storage device, and/or data commu- 
nications device into the memory of the server 16 for use 

10 during actual operations. 

Thus, the present invention may be implemented as a 
method, apparatus, or article of manufacture using standard 
programming and/or engineering techniques to produce 
software, firmware, hardware, or any combination thereof. 

35 The term "article of manufacture" (or alternatively, "com- 
puter program product") as used herein is intended to 
encompass a computer program accessible from any 
computer-readable device, carrier, or media. Of course, 
those skilled in the art will recognize many modifications 

20 may be made to this configuration without departing from 
the scope of the present invention. 

Those skilled in the art will recognize that the exemplary 
environment illustrated in FIG. 1 is not intended to limit the 
present invention. Indeed, those skilled in the art will 
recognize that other alternative hardware environments may 
be used without departing from the scope of the present 
invention. 

Data Structure 

■'^ FIG. 2 shows one possible structure of the history log 
database 24 according to the present invention. The database 
may be a table comprised of rows and columns, although 
other structures may also be used. In the database 24 
illustrated herein, each row of the table typically includes a 
name or title for the web page 26, an HTT? URL 28 for the 
web page, and a counter 30 that is incremented every time 
the web page is accessed. Rows are created for each web 
page accessed by a user and are updated each time the web 
page is accessed. 

The records m the history log database 24 are used to 
generate the multiple reference hotlist of the present 
invention, wherein the hotlist displays the title 26, URL 28, 
and counter 30 for the most popular web pages. In creating 
the multiple reference hotlist from the history log database 
24, the rows would preferably be sorted by the value in the 
counter 30 to indicate the popularity for each web page. 

In addition, a cutoff value could be established for the 
counter 30 to eliminate less-popular web pages from the 

50 multiple reference hotlist. For example, if a cutoff value of 
100,000 were used with the example rows of FIG. 2, the 
Yahoo!™ Finance web page would not be used to create the 
multiple reference hotlist. Depending on the cutoff value 
used, different multiple reference hotlists would be dis- 

55 played to the user. 

Each row of the history log database 24 could also include 
a timestamp to indicate how long the web page was 
displayed, in order to provide a "quality of visit" parameter 
for the database 24. This quality of visit parameter could be 

60 used, inter alia, to allow the database 24 to show a nested 
page as the popular site, rather than a home page. This 
allows the multiple reference hotlist to more accurately 
portray the popular web pages. 

g5 Flowchart 

FIG. 3 is a flowchart illustrating the logic performed in 
creating the history log database 24 and the multiple refer- 
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ence hotlist according to the present invention. As indicated storing the accumulated count and an address for each 

above, this logic could be performed by the computer ' web page in a record of a history log database associ- 

program 22, the web server 20, and/or the operating system ated with that web page and stored by the network 

18 in various implementations. However, for the purposes of server; 

illustration only, the logic will be described in conjunction 5 generating a multiple reference hotlist by selecting web 

with the computer program 22. pages based on their accumulated counts, wherein the 

Block 32 represents the computer program 22 waiting for multiple reference hotlist comprises a list of addresses 

an event to occur. After an event occurs, control is trans- for web pages retrieved from the records and wherein 

ferred to Blocks 34-52. the list is sorted by the accumulated counts retrieved 

Block 34 is a decision block that represents the computer records; and 

program 22 determining whether the event was a user's displaying the multiple reference hotlist for a user, 

request for a web page. If so, control is transferred to Block 2. The method of claim 1, wherein the step of accumu- 

36; otherwise, control is transferred to Block 44. Block 36 lating further comprises the step of accumulating the count 

represents the'computer program 22 attempting to retrieve a of retrievals whenever a user requests the web page, 

record for the requested page from the history log database 3. The method of claim 1, wherein the step of storing 

24. Block 38 is a decision block that represents the computer further comprises the step of storing a title for the web page, 

program 22 determining whether the record was found in the 4. The method of claim 3, wherein the step of displaying 

history log database 24. Ifnot, control is transferred to Block further comprises the step of displaying the title for each 

40; otherwise control is transferred to Block 42. Block 40 web page in the multiple reference hotlist, 

represents the computer program 22 creating the record for 5. The method of claim 1, wherein the step of accumu- 

the web page in the history log database 24. Block 42 20 lating further comprises the step of accumulating a time 

represents the computer program 22 incrementing the between retrievals of the web page and another web page, 

counter 30 in the record of the history log database 24 and the step of storing further comprises the step of storing 

associated with the requested web page. Thereafter, control the accumulated time in the record of the history log 

is transferred back to Block 32. database. 

Block 44 is a decision block that represents the computer 25 6. The method of claim 1, wherein the step of formatting 

program 22 determining whether the event was a request by further comprises the step of limiting the list to records that 

the user to display the multiple reference hotlist. If so, have an accumulated count that exceeds a cutoff value, 

control is transferred to Block 46; otherwise, control is 7. A computerized apparatus for identifying web pages, 

transferred to Block 52. Block 46 represents the computer comprising: 

program 22 retrieving one or more records from the history 30 a network server; 

log database 24. Block 48 represents the computer program a computer program, executed by the network server, for 

22 formatting the retrieved records from the database 24 into accumulating a count of retrievals of one or more web 

a multiple reference hotlist that can be presented to the user pages, wherein the count is incremented for each web 

for display. Block 50 represents the computer program 22 page each item that web page is retrieved by any user, 

transmitting the multiple reference hotlist to the user for 35 for storing the accumulated count and an address for 

display. Thereafter, control is transferred to Block 32. each web page in a record of a history log database 

Block 52 represents the computer program 22 performing associated with that web page and stored by the net- 
other processing as required. Thereafter, control is trans- work server, for generating a multiple reference hotlist 
ferred to Block 32. by selecting web pages based on their accumulated 
Conclusion counts wherein the multiple reference hotlist comprises 

a list of addresses for web pages retrieved from the 

In conclusion, the present invention discloses a method, records and wherein the list is sorted by the accumu- 

apparatus, and article of manufacture for locating web pages lated counts retrieved from the records, and for dis- 

from a network server. A count of retrievals of a web page playing the multiple reference hotlist for a user, 

is accumulated and the accumulated count and an address 45 xhe apparatus of claim 7, wherein the computer pro- 

for the web page are stored in a record of a history log gram further comprises means for accumulating the count of 

database at the network server. A multiple reference hotlist retrievals whenever a user requests the web page, 

is formatted from the records in the history log database, 9^ The apparatus of claim 7, wherein the computer pro- 

wherein the multiple reference hotlist comprises a list of gram further comprises means for storing a title for the web 

addresses for web pages retrieved from the records and the 50 page. 

list is sorted by the accumulated counts retrieved from the 10, The apparatus of claim 9. wherein the computer 

records. The multiple reference hotlist is then displayed for program further comprises means for displaying the title for 



a user. 



each web page in the multiple reference hotlist. 

The foregoing description of the preferred embodiment of H. The apparatus of claim 7, wherein the computer 

the invention has been presented for the purposes of illus- 55 program further comprises means for accumulating a time 

tration and description. It is not intended to be exhaustive or between retrievals of the web page and another web page 

to hmit the invention to the precise form disclosed. Many and means for storing the accumulated time in the record of 

modifications and variations are possible in light of the the history log database. 

above teaching. It is intended that the scope of the invention 12. The apparatus of claim 7, wherein the computer 

be limited not with this detailed description, but rather by the 60 program further comprises means for limiting the list to 

claims appended hereto. records that have an accumulated count that exceeds a cutoff 

What is claimed is: value. 

1. A method for identifying web pages from a network 13. An article of manufacture comprising a computer 

server, comprising the steps of: program carrier readable by a network server and tangibly 

accumulating a count of retrievals of one or more web 65 embodying one or more computer programs executable by 

pages, wherein the count is incremented for each web the network server to perform method steps for identifying 

page each time that web page is retrieved by any user; web pages, the method steps comprising: 
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accumulating a count of retrievals of one or more web 
pages, wherein the count is incremented for each web 
page each time that web page is retrieved by any user; 

storing the accumulated count and an address for each 
web page in a record of a history log database associ- 
ated with that web page and stored by the network 
server; 

generating a multiple reference hotlist by selecting web 
pages based on their accumulated counts, wherein the 
multiple reference hotlist comprises a list of addresses 
for web pages retrieved from the records and wherein 
the list is sorted by the accumulated counts retrieved 
from the records; and 

displaying the multiple reference hotlist for a user. 

14. The article of manufacture of claim 13, wherein the 
step of accumixlating further comprises the step of accumu- 
lating the count of retrievals whenever a user requests the 
web page. 

15. The article of manufacture of claim 13, wherein the 
step of storing further comprises the step of storing a title for 
the web page. 

16. The article of manufacture of claim 15, wherein the 
step of displaying further comprises the step of displaying 
the title for each web page in the multiple reference hotlist. 
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17. The article of manufacture of claim 13, wherein the 
step of accumulating further comprises the step of accumu- 
lating a time between retrievals of the web page and another 
web page, and the step of storing further comprises the step 

5 of storing the accumulated time in the record of the history 
log database. 

18. The article of manufacture of claim 13, wherein the 
step of formatting further comprises the step of limiting the 
list to records that have an accumulated count that exceeds 
a cutoff value. 

19. A computer-readable memory for storing a computer 
program that identifies web pages from a network server, 
comprising; 

a data structure stored in the memory of the network 
server, the data structure including an accumulated 
count of retrievals of one or more web pages and an 
address for each web page, wherein a multiple refer- 
ence hotUst is generated from the data structure for 
display to a user, wherein the multiple reference hotlist 
comprises a list of addresses for web pages retrieved 
from the data structure based on their accumulated 
counts and wherein the list is sorted by the accumulated 
counts retrieved from the data structure. 

***** 
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